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Monitoring protein-protein interactions in living cells is key to unraveling their roles in numerous cellular 
processes and various diseases. Previously described split-GFP based sensors suffer from poor folding and/ 
or self-assembly background fluorescence. Here, we have engineered a micro-tagging system to monitor 
protein-protein interactions in vivo and in vitro. The assay is based on tripartite association between two 
twenty amino-acids long GFP tags, GFP10 and GFP11, fused to interacting protein partners, and the 
complementary GFP 1-9 detector. When proteins interact, GFP 10 and GFP 11 self- associate with GFP 1-9 to 
reconstitute a functional GFP. Using coiled-coils and FRB/FKBP12 model systems we characterize the 
sensor in vitro and in Escherichia coli. We extend the studies to mammalian cells and examine the FK-506 
inhibition of the rapamycin-induced association of FRB/FKBP12. The small size of these tags and their 
minimal effect on fusion protein behavior and solubility should enable new experiments for monitoring 
protein-protein association by fluorescence. 

Signal transduction and gene expression often involves protein -protein interactions. Optical tracking of 
these interactions can help to illuminate regulatory mechanisms and identify aberrant processes in diseases. 
Fluorescent protein biosensors have been developed to measure protein complexes in living cells 1 . For 
example, bioluminescence resonance energy transfer (FRET and BRET) enable dynamic observations of protein- 
protein interactions 2 . More recent developments include protein-fragment complementation assays (PCA) that 
monitor protein-protein interactions by reconstitution of fragments of various enzymes split into two pieces (as 
fused tags on passenger proteins) including fragments of dihydrofolate reductase 3 , P-galactosidase 4 , P-lactamase 5 , 
and the firefly and Gaussia luciferases 6 . Though sensitive, these have the limitation that the observed products 
diffuse away from the protein interaction site. PCA based on split green fluorescent protein (GFP) and color 
variants 7 is termed bimolecular fluorescence complementation (BiFC). BiFC relies on (i) interactions between 
bait and prey proteins that bring together two non-fluorescent split protein domains and (ii) subsequent co- 
folding into the P-barrel structure to form the chromophore 8 . Because assembly of the GFP fragments is irre- 
versible, the stability of the BiFC enables integration, accumulation, and subsequent detection even of transient 
interactions and low affinity complexes 9,10 . These remain attached to the interacting proteins, enabling tracking of 
the complex. However, improving existing BiFCs based on large and bulky fragments is challenging as increasing 
BiFC fragment solubility and folding can increase background signals from spontaneous assembly of the fluor- 
escent protein fragments 11 . For example, BiFC fragments 12 obtained by fragmentation of folding-reporter GFP 
(FR-GFP) 13 and superfolder GFP (sfGFP) 14 at permissive sites 156 and 173 were aggregation -prone and had high 
backgrounds from self-assembly (Supplementary Fig. SI). Using a smaller tag can reduce aggregation and folding 
interference. For example, one split GFP uses a very small 15 amino acid tag for quantification of soluble protein 
and tracking proteins in vivo 15 ' 17 . However the large size of the remaining piece and self-assembly of the fragments 
makes this system less suited for protein-protein interaction studies. We had previously developed a sponta- 
neously assembling split GFP based on GFP 10-11 (residues 194-233, the tag) and GFP1-9 (residues 1-193, the 
detector) (Waldo lab, unpublished results). Here we describe the further engineering of this split GFP to yield an 
entirely new protein interaction assay based on the association of three fragments of the GFP: two short peptides 
GFP10 (residues 194-212) and GFP11 (residues 213-233) each tagged to one of the interacting partners, and a 
third large GFP 1-9 (residues 1-193) detector fragment (Fig. 1). We present the characterization of the assay using 
attractive and repulsive pairs of charged coiled-coils peptides 18 and the rapamycin mediated heterodimerization 
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Figure 1 | Principle of the tripartite split-GFP complementation assay. (3- 
strand 10 (GFP10) and (3-strand 1 1 (GFP1 1) are fused to bait (A) and prey 
(B) proteins, respectively and the detector fragment GFP 1-9 (|3-strand 1- 
9) is added separately. When protein interaction occurs, GFP 10 and GFP1 1 
are tethered and then spontaneously associate with GFP 1-9 fragment to 
form a full-length GFP. If proteins A and B do not interact, GFP 10 and 
GFP1 1 are not tethered and entropy is too high to allow complementation 
with GFP 1-9. 

of FK506 binding protein (FKBP) and FKBP12-rapamycin binding 
domain (FRB) 19 . The assay correctly reports the localization of pro- 
tein-protein complexes in mammalian cells with very low back- 
ground fluorescence levels. The chief advantage over BiFC is the 
small size of the tags (ca. 20 amino acids), and the concomitant 
reduced interference with passenger protein folding. The tripartite 
split GFP assay may find broad utility in the visualization of protein 
complexes specially where bulkier BiFC fragments might impede 
localization or interfere with folding. 



Results 

Engineering a three-body split-GFP system for improved solubi- 
lity and complementation. We previously identified self- assembling 
split-GFP fragments corresponding to the C-terminal p-hairpin 
GFP10-11 of sfGFP 14 (residues 194-238) and the large fragment 
GFP1-9 (residues 1-193) (Fig. 2). In order to improve GFP 1-9 
folding efficiency, we performed two rounds of directed evolution 
and obtained a new variant of GFP 1-9 named GFP 1-9 Ml that 
contained five additional mutations relative to sfGFPl-9 (Fig. 2, 
Supplementary Note). We converted this bimolecular pair into a 
three-body split-GFP in a stepwise manner. First, in attempt to 
destabilize the GFP10-11 P-hairpin self-assembly, we inserted a 
long flexible linker that included a cloning site between GFP 10 and 
GFP 11 and evolved the whole cassette for improved solubility and 
complementation efficiency with GFP 1-9 Ml (See Methods). Next, 
we spaced out GFP 10 and GFP 11 further by inserting a partially 
soluble bait protein, hexulose phosphate synthase or HPS 15 in the 
cloning site of the linker. HPS alone was —60% soluble and was used 
in our earlier study as a bait protein to drive the evolution of more 
soluble split-GFP tags 15 (Supplementary table 1 and Supplementary 
Note). In the final step, we evolved GFP 1-9 Ml to recognize this 
"sandwich" bait topology. When complemented with the optimized 
GFP10 and GFP1 1 tags attached to the N and C termini of the HPS 15 , 
the resulting variant named GFP 1-9 OPT, exhibited a 40-fold 
improvement in the rate of formation of fluorescence (Fig. 3a, 
black line) relative to GFP 1-9 Ml (Fig. 3a, gray line). Differences 
between the GFP 1-9 Ml were less obvious when complemented with 
the GFP 10- 11 hairpin (displayed on permissive loop of a cherry 
fluorescent protein, Waldo lab, manuscript under review) (Fig. 3a, 
right panel), presumably due to the reduced entropy of the GFP 10- 
11 hairpin relative to the GFP10-HPS-GFP11 "sandwich" format. 
GFP 1-9 OPT amino-acid sequence revealed the presence of four 
additional mutations relative to GFP 1-9 Ml (Fig. 2). We then 




O GFP1-9M1 
O GFP1-9 OPT 

Figure 2 | Secondary structure diagram of GFP 1-9 variants with corresponding mutations. Superfolder GFP 1-9 (green dots), GFP 1-9 Ml (yellow), 
and GFP1-9 OPT (orange). 
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Figure 3 | Complementation of the tripartite split-GFP in vitro and in vivo, (a) Complementation curves of GFP1-9 Ml (gray line) and GFP1-9 OPT 
(black line) with equimolar GFPlO-sulfite reductase (SR)-GFPl 1 fusion protein (left) or GFP10-1 1 hairpin domain displayed on a permissive loop of a 
superfolder red fluorescent protein (right) (Waldo lab, manuscript under review), (b) In vivo solubility screen of 18 Pyrobaculum test proteins 15 expressed 
in a "sandwich" configuration with N-terminal GFP10 and C-terminal GFP11 (GFP10-A-GFP11, A = protein of interest) from pTET-GFP10/ll 
plasmid assayed with either GFP1-10 (top) or GFP1-9 (bottom) expressed from a pET vector in BL21(DE3) E. coli. 15 Fluorescence pictures of E. coli 
colonies on plates after 1-1/2 h Antet induction followed by IPTG induction (sequential induction) (SEQ) or 3 h co-induction (CO). Legend indicates of 
the tagged proteins determined by SDS-PAGE (soluble (green), partially soluble (yellow), insoluble (red)). 



examined whether optimized GFP10 and GFP11 would affect the 
solubility of fused proteins. We cloned set of eighteen P. aero- 
philum test proteins into a modified pTET vector as GFP10-POI- 
GFP11 "sandwiches" (POI = protein of interest) to assay solubility 
levels as previously described 16 . We had previously measured the 
solubility of these eighteen controls without tags and shown that 
the GFP11 M3 tag engineered in the earlier study did not perturb 
passenger solubility 16 . We first assayed the sandwich fusions using 
GFP1-10 to detect only the GFP11 tag (Fig. 3b). Co-expressing 
GFP1-10 and the sandwich resulted in bright colonies reflecting 
total protein expression (Fig. 3b, top). On the other hand, 
expressing the sandwich first, shutting off expression to allow the 
proteins to remain soluble or aggregate, then expressing the GFP1- 
10 detector (sequential induction) gave weak fluorescence for the 
protein expressed in insoluble form (proteins #6,8,15,16) as 
expected. This pattern was very similar to that previously seen for 
the GFP11 M3 tag study 16 indicating that the GFP10-POI-GFP11 
sandwich did not strongly perturb the solubility of the test proteins. 
Note that the version of GFPS1 1 developed in this manuscript differs 
slightly from the GFP11 M3 tag described earlier 16 . Next we studied 
the complementation of the sandwich with GFP1-9 OPT (subse- 
quently referred to as GFP1-9) (Fig. 3b, bottom). Interestingly, whe- 
ther the sandwich and GFP1-9 were co-expressed or sequentially 
expressed, only the soluble proteins gave fluorescent colonies (Fig. 3b, 
bottom). When GFP1-10 is co-expressed, insoluble protein can be 
labeled (Fig. 3b, top), ostensibly because the available GFP1-10 can 
capture the GFP11 M3 tag early on, committing the fluorophore to 



form regardless of the subsequent fate of the fusion. In contrast, even 
when co-expressed with GFP1-9, one or both tags of insoluble 
proteins must become inaccessible prior to productive binding 
with GFP1-9. This suggests the GFP 1-9 could be used to strin- 
gently assess protein solubility, without the concern of false - 
positives caused by capturing transiently soluble proteins as with 
GFP1-10. 

Validating a protein-protein interaction sensor using coiled-coil 
heterodimerization. To test whether GFP 1-9 is capable of detecting 
GFP 10 and GFP 11 fused to separate interacting protein molecules, 
we selected two pairs of coiled-coils developed by Hodges and 
coworkers 18 based on their ability to direct preferential interaction 
between oppositely charged Kl/El coiled-coils or repulsion for the 
negatively charged El /El pair (Supplementary Fig. S2). This model 
system features rapid association of the K/E heterodimer and a lack 
of E coil homodimerization 18 . Pair-wise combinations of E1-GFP11 
and GFP10-E1, or GFP10-K1 and E1-GFP11 were expressed from 
an anhydrotetracycline (Antet) inducible bicistronic pTET vector 
(Supplementary Fig. S3), and transformed into E. coli cells ex- 
pressing the GFP 1-9 from an isopropyl P-D-l-thiogalactopyra- 
noside (IPTG) inducible pET vector (Fig. 4a). After 3 h E. coli 
GFP1-9 cells co-expressing GFP10-K1 and E1-GFP11 turned 
brightly fluorescent, thus demonstrating efficient K/E coiled-coil 
heterodimerization. In contrast, co-expression of GFP10-E1 and 
E1-GFP11 produced residual background fluorescence, com- 
parable with mock induction after either Antet or IPTG induction 
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Figure 4 | Characterization of protein-protein interactions using coiled-coil heterodimerization. (a) Expression vectors for tripartite split-GFP 
interaction assays in E. coli: bicistronic pTET tetracycline inducible vector harboring GFP10 and GFP11 tags with cloning sites for test proteins; IPTG 
inducible pET vector (T7 promoter) for stable expression of GFP1-9 OPT. (b) Colony fluorescence on plates after Antet and IPTG co-induction driving 
expression of GFP10-K1/E1-GFP11 pair (K/E) (top) and GFP10-E1/E1-GFP11 (El/El) (bottom) with GFP1-9 detection fragment (c) In vitro 
quantification of Kl/El or El/El interaction with increasing amount of E-GFP11 (0.4 nM up to 800 nM). Fluorescence values 1 h after initiating 
complementation with GFP1-9. (d) Initial velocity rates of complementation reactions with various concentrations of GFP1-9 detection reagent (0.5 uM 
up to 8 uM) in GFP10-K1/E1-GFP11 assay (1 uM of each tagged species). 



in E. coli (Fig. 4b). These results suggest minimal GFP10 and GFP11 
self-association in the absence of interacting fused proteins. Using 
the Kl/El model, we also chose the GFP10 construct (residue 194- 
212) as the optimal 'break point' for detecting protein-protein 
interactions (Supplementary Fig. S4). To assess the sensitivity of 
the assay, GFP10-K1, GFP10-E1, and E1-GFP11 fusion proteins 
were individually expressed from pET vectors in E. coli and puri- 
fied using immobilized metal affinity chromatography (Supplemen- 
tary Fig. S5a). Quantified GFP10-K1 or GFP10-E1 were mixed with 
E1-GFP11 at varying concentrations, and supplemented with a 
molar excess of the GFP1-9 fragment (8 |iM) to initiate rapid 
complementation and ensure that kinetic rate is indepen- 
dent of GFP1-9 concentration. Fluorescence of the GFP10-K1/E1- 
GFP11 interaction assay was linearly related to the concentration of 
E1-GFP11 substrate, as previously observed with self-assembling 
split-GFP assays 15 (Figure 4c). The assay could detect Kl/El coil 
interaction down to 1 nM which corresponds roughly to the 
dissociation constant of the heterodimer 18 . As expected, the El /El 
pair interaction signal remained at a basal level of fluorescence even 
at very high concentrations (1 mM) (Fig. 4c). These results are 
completely consistent with a model in which tripartite split-GFP 
complementation occurs only when GFP10 and GFP11 are 
brought into close proximity by interacting protein partners, 
presumably by reduction in entropy. To study how different 
concentrations of the detecting reagent (GFP1-9) affected the 



assay, two-fold successive dilutions of GFP1-9 were added to the 
protein mixtures containing GFP10-K1 and E1-GFP11 at the same 
final concentration (1 |iM each). We observed a slower kinetic 
complementation rate as the reagent concentration decreased, com- 
pletely consistent with simple bimolecular reaction kinetics (Fig. 4d). 
For example, the initial rate for appearance of fluorescence drops by a 
factor of two each time the concentration of GFP1-9 is halved. 
Although further studies are needed to understand the precise 
mechanism of assembly, these observations are consistent with a 
model in which the coils bring the smaller fragments into proxi- 
mity prior to detection by GFP1-9, i.e., approximating a two-body 
system with the tethered GFP10 and GFP11 acting as a module 
interacting with GFP1-9. 

Inducible FRB/FKBP interactions. To further test the specificity and 
dynamic range of the method, we used the well-studied FKBP12-FRB 
rapamycin inducible protein interaction 19 . We fused GFP10 to the 
rapamycin-binding domain of the mammalian target of rapamycin 
(mTOR) kinase (FRB) and GFP11 to the FK506-binding protein 12 
(FKBP) and tested their association in vitro upon addition of recom- 
binant GFP 1-9 protein (Fig. 5a). GFP10-FRB and FKBP-GFP11 
fusion proteins were co- expressed from bicistronic pTET GFP 10- 
IRBS-GFP11 vector (Supplementary Fig. S3). In earlier studies of 
interacting proteins in cell extracts using BiFC, misfolding and 
aggregation issues interfered with complementation unless the 
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Figure 5 | Application of the tripartite split-GFP to study the ramapycin inducible FRB/FKBP interaction, (a) GFP 1 0 and GFP 1 1 tags were fused to FRB 
and FKBP proteins. Rapamycin ligand binding brings both protein fusions into proximity, permitting GFP fluorescence reconstitution upon addition of 
GFP1-9. (b) Raw fluorescence progress curves for GFP1-9 complementation with soluble extracts of GFP10-FRB and FKBP-GFP11 fusions in presence 
( + RAP) or absence ( — RAP) of rapamycin (starting time marked by addition of rapamycin to initiate complementation) (c) Fluorescence levels of 
GFP10-FRB and FKBP-GFP11 assayed at various concentrations (320, 160, and 40 nM in each) initiated by addition of rapamycin to 150 nM final 
concentration (black bar) and no rapamycin (gray bar), (d) Rapamycin dose curve (0.6 to 300 nM) for FRB/FKPB binding in vitro measured as final 
fluorescence after 1 h. 
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proteins had been co -expressed or were refolded after denaturing 8 . In 
contrast, our soluble crude E. coli cell extracts could be mixed with a 
4 uM solution of purified GFP1-9 and incubated for several minutes 
prior to addition of 150 nM rapamycin, after which the rapid increase 
in fluorescence indicated soluble interacting GFP10-FRB and FKBP- 
GFP11 protein. A two-fold increase in fluorescence was easily 
detectable after only 10 minutes incubation (Fig. 5b, inset), with a 
half-life of 60 minutes. Without added rapamycin fluorescence 
remained near blank levels (Fig. 5b). To further examine the 
behavior of the protein interaction reporter, we studied purified 6- 
His-GFPIO-FRB and 6-His-FKBP-GFPll fusion proteins (Supple- 
mentary Fig. S5b). In the presence of 150 nM rapamycin (much 
greater than the reported Kd of ca. 12 nM 19 ), endpoint fluorescence 
was directly proportional to the amount of added protein complex as 
expected (Fig. 5c). To further evaluate reaction kinetics and equilibria 
of rapamycin- FKBP/FRB protein association, we tested the effect of 
increasing concentrations of rapamycin (0.05 to 300 nM). The 
rapamycin dose response curve presents a half maximal value of 
13.5 nM, in accordance with previously published data 19 (Fig. 5d). 

Monitoring the association of protein complexes in E. coli. Protein 
signaling modules often involve more than two interacting partners. 
To examine the ability of our tagging system to detect multimeric 
complexes, we selected two polycistrons from the E. coli genome 
encoding the heterotrimer Tus BCD complex (YheNML) 20 and a 



putative allophanate hydrolase constituted by the dimeric assembly 
YBGK/YBGJ 21 . We also studied an alternate version of the Tus 
complex where YheN subunit was omitted, thus preventing stable 
association of the complex as previously reported 20 . These 
polycistrons were PCR amplified from genomic DNA including 
their internal ribosome binding sites. Translation of individual 
complex subunits was dependent on natural IRBS present in the 
polycistron, so the first subunit had an N-terminal GFP10 and the 
last subunit had a C-terminal GFP11 furnished by the pTET SpecR 
vector (Fig. 6a). To ensure sufficient flexibility between complex 
subunits, especially for the YheNML whose subunits have nearly 
buried N-termini in the assembled complex 20 , 30 and 25-mer 
linkers were inserted between the polycistron and the split- GFP 
tags (Fig. 6b and Supplementary Fig. S3). These constructs were 
transformed into E. coli cells expressing either the GFP 1-9 
fragment for complex detection (requiring both the GFP 10 and 
GFP 11 tag), or the GFP 1-10 fragment 16 to assess the expression 
level and solubility by monitoring the GFP 11 tag (Fig. 6b and 6c). 
All the complexes were well expressed and highly soluble as shown in 
Fig. 6c, column A, B. Cells expressing the trimeric YheNML complex 
became fluorescent after 4h complementation with GFP 1-9, as 
expected (Fig. 6c, ID). In contrast, weak fluorescence was seen for 
colonies expressing only YheM and YheL (missing the YheN 
subunit) with GFP 1-9 (Fig. 6c, 2D). Colonies expressing the 
YBGJ/YBGK control complex were brightly fluorescent after 
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Figure 6 | Monitoring complex formation and stability in E. coli. (a) Analysis of YheNML heterotrimeric complex formation using split-GFP. GFP1-9 
complementation provides information on interaction of GFP10 and GFP11 tagged domains, while GFP1-10 binds to GFP11 tagged protein, thus 
providing expression and solubility levels of the GFP 11 tagged protein or complex, (b) E. coli polycistrons YheNML, YheML and YBJG/K were 
subcloned between GFP10 and GFP11 strands with extended linkers (/25 and /30) into pTET GFP10/11 tet inducible plasmids. (c) Split-GFP assay in 
E. coli cells expressing either GFP 1-9 or GFP 1-10. Sequential or co-induction was performed as previously described. Corresponding semi- quantitative 
measure of colony fluorescence using NIH Image (right). 



SCIENTIFIC REPORTS | 3 : 2854 | DOI: 1 0.1 038/srep02854 



6 



GFP DAPI Overlay 



GFP DAPI Overlay 



Z-11 




■ 


10-Z/Z-11 






Z-11+ 1-10 


■ 


H 



50 



8 40 

§ 30 
to 

§ 20 

=3 



10 



Ku70-11 






10-Ku70/Ku80-11 




O 


Ku70-11+ 1-10 








n JQ 



1,000 * ™, 
c c 
CD g 

.? ^ 

<D =3 
O Q_ 

C o 
go. 

8 1 



800 



600 



400 



200 



o 

LL S 
r- <Q 



-RAP 



DAPI 



GFP 



10 



20 



50 



• RAP 



FKBP-11 
+ 1-10 


FKBP-11 
+10-FRB 
(GFP1-9) 


ft* 

* \ 






EE 















100 [Rapamycin] (nM) 



Rap 20nM 




10 

[Rapamycin] (nM) 



0.01 
[FK-506] (mM) 



Figure 7 | Visualization of complex formation in mammalian cells, (a) Leucine zipper and Ku70/80 heterodimerization. Cells transiently expressing 
GFP 1-9 with GCN4 zipper (Z-11) (left panel) or Ku70-GFP11 (Ku70-ll) (right panel) display no background fluorescence. Heterodimerization is 
visualized as fluorescence with interacting leucine zippers GFP10-Z and Z-GFP1 1 in CHO cells expressing GFP 1-9. Complementation of GFP1 1-Z with 
GFP1-10 confirms localization of the zipper alone (bottom). Ku70/80 complex formation is visualized in cell nuclei (right panel). Expression of one Ku 
component tagged with GFP11 is monitored with GFP1-10. FACS analysis of HEK 293_GFP 1-9 cell lines transfected with corresponding constructs 
(24 h after transfection). Percentage of fluorescent cells (black bars); mean fluorescence intensity of the positive cell population (gray bars). Self- 
associating GFP10-Z-GFP1 1 domain is used as positive control of transfection and complementation with GFP1-9 (mean ± SD; N = 3). (b) Rapamycin 
induced FRB/FKBP interaction in mammalian cells. Stable HEK 293 cells expressing GFP 1-9 were co-transfected with GFP10-FRB and FKBP-GFP11 
constructs and stimulated with increasing concentrations of rapamycin (RAP) (0, 10, 20, 50 and 100 nM). Bipartite complementation of FKBP-11 and 
GFP 1-10 is shown in the most left image; Green fluorescence at 488 nm excitation (GFP), DAPI nuclear staining (cyan). Scale bars =10 um. Bottom 
panel: FACS quantification of rapamycin induced association with or without addition of competitive inhibitor FK-506 ( 1 uM). Right graph: addition of 
increasing concentrations of FK-506 (20 nM rapamycin; 0.01, 0.1 and 1 uM FK-506) (mean ± SD; N = 3). 
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complementation with GFP1-9 as expected (Fig. 6c, 3D). These 
results are in good agreement with those obtained in our previous 
bead-based assays using GFP1-10 complementation 21 . 

Visualization of protein-protein interactions in mammalian cells. 

We adapted the split- GFP fragments for optimized expression in 
mammalian cells and tested the formation of several eukaryotic 
complexes by fluorescence microscopy. As proof of principle, we 
first fused the yeast GCN4 leucine zipper 22 to each of the GFP 10 
and GFP 11 tags and co-transfected both constructs along with the 
GFP 1-9 detector fragment into CHO cells to study the formation of 
GCN4 heterodimer (Fig. 7a, left panel). Nuclear fluorescence 
corresponding to leucine zipper heterodimerization was observed 
in cells expressing GFP 1-9 after 24 h of transfection (Fig. 7a, lane 
2). As expected, co-expression of GFP 1-9 and one of the leucine 
zipper domain C-terminally fused to GFP1 1 did not produce fluore- 
scence (Fig. 7a, lane 1). In a separate experiment we observed the 
fluorescence complementation of the GFP 11 tagged proteins with 
GFP 1-10, which measures soluble protein levels in cells 15 . As 
expected, the leucine zipper domain alone was strongly localized in 
the nucleus and only weakly in the cytoplasm (Fig. 7a, lane 3). We 
were also able to detect much larger protein complexes, such as the 
Ku70-Ku80 complex 23 that is required in the non-homologous end- 
joining (NHEJ) pathway in DNA repair. We successfully detected 
heterodimerization between GFP10-Ku80 and Ku70-GFP11 in 
HEK 293 cells expressing stably the GFP 1-9 fragment (Fig. 7a, 
right panel). To shed additional light on possible back- 
ground from the tripartite assay in living cells, we co-expressed 
leucine- zipper and Ku proteins that localize in the same sub 
cellular compartment. Quantification of the fluorescence levels by 
flow activated cell sorting (FACS) indicated basal levels of fluore- 
scence from individual split-GFP controls and between non- 
interacting proteins tested. Pairwise leucine zipper and Ku subunit 
heterodimerization led to fluorescence levels comparable to the 
bipartite split-GFP assay, which titrates soluble protein levels in 
cells 15 (Fig. 7a). In parallel, we co-expressed GFP10-FRB and 
FKBP-GFP11 protein fusions in HEK 293_GFPl-9 cells, stimu- 
lated with increasing concentrations of rapamycin. Fluorescence 
levels for the unstimulated cells remain basal, whereas bright 
fluorescent cells could be visualized with as little as 10 nM rapamy- 
cin, in the range of the EC 50 value measured for the ternary complex 
in vitro 24 (Fig. 7b). Moreover, pre-addition of an excess of 
competitive inhibitor FK-506 totally prevented FRB/FKBP associa- 
tion by rapamycin in an FK-506 dose dependent manner (Fig. 7b, 
bottom panel), thus demonstrating that tripartite split-GFP 
complementation can be used to monitor inhibitors of protein- 
protein interactions by small molecule compounds prior to 
association of interacting partners. 

Discussion 

Here we describe the first protein -protein interaction reporter based 
on tripartite split-GFP association. Instead of the bulky and poorly 
folded BiFC fragments of GFP 12 ' 25 , our tags based on small engi- 
neered P-strands of GFP (—20 amino acids long) minimize protein 
interference and aggregation. Interaction assays using the E-coil/ 
K-coil model 18 have a sensitivity limit in the picomole range. 
Chemically induced interactions of the FRB/FKBP complex are 
detectable using the split-GFP within a few minutes after addition 
of rapamycin in vitro. Bulky BiFC fragments from various fluor- 
escent proteins are expressed largely in E. coli in inclusion bodies 8 , 
and the effect of the fragments from other enzyme-based PCA assays 
on the folding of fused proteins is poorly understood 26 . Here we show 
that our tagging system is highly soluble, permitting production of 
high yield of fusion proteins at 37°C in E. coli. The assay is therefore 
not temperature dependent, unlike other BiFC for which co-express- 
ion and decreasing growth temperature, or refolding denatured 



extracts are the only ways to improve assembly of split-GFP frag- 
ments 8,27 . We have shown that the method is sensitive enough to 
detect association of proteins expressed from E. coli polycistronic 
mRNAs, and fluorescence is correlated with independently 
measured protein complex formation and stability. As shown for 
the TusABC (YheNML) studies above, we could monitor the forma- 
tion of larger trimeric complexes by extending the length of the 
linkers between the GFP tag and the protein of interest. This suggests 
that precise geometry of the GFP 10 and GFP 11 is not critical for 
interaction with GFP 1-9, but rather that reduction of entropy is 
important for triggering assembly of the GFP 10, GFP 11, and 
GFP1-9. 

The newly described tripartite split-GFP assay is a promising tool 
to study protein-protein interactions in vitro and in living cells. Its 
chief advantage over the bulky aggregation -prone fragments of exist- 
ing BiFC is the small sizes of the GFP 10 and GFP1 1 tagging peptides. 
It is therefore well suited to study interactions of unstable protein 
complexes that are difficult to detect with larger GFP tags. Our sys- 
tem greatly expands the prospect for protein-interaction screening 
and the design of new biosensors for protein complex assembly and 
association 28 . Although association of the 3 -body GFP fragments is 
currently irreversible, this system can be exploited to turn on the 
detection of protein complex formation by simple addition of 
GFP 1-9 reagent. The method is particularly well adapted to high- 
throughput interaction screens of libraries of protein-protein inter- 
faces and domains 29 , the study of complex formation stability, and 
the robust detection of soluble proteins and protein complexes by 
flow cytometry. The technology should prove useful for screening 
small molecule compounds, for example inhibitors for the interfaces 
of macromolecular protein complexes. Further work might yield 
split tripartite GFPs whose assembly can be regulated with light 30 . 

Methods 

Cloning. Kl-coil and El -coil DNA, FRB (NM_004958) and FKBP (NMJ300801) 
sequences were amplified by PCR using synthetic oligonucleotides (SI note) and 
cloned into a pTET ColEl SpecR GFP10/11 via NdehKpnl (GFP10 fusion) or 
SpekBamHl (GFP11 fusion). Linker extension of pTET GFP10/11 was performed 
using inverse PCR with synthetic oligonucleotides (Supplementary Fig. S3). GFP1-9 
was cloned into a pET28a pl5 Kan vector via NdehBamHl sites. Proteins expressed 
with GFP 10 (El and Kl coils, FRB) and proteins expressed with GFP 11 (El coil, 
FKBP) were subcloned into a pET vector bearing a C-terminal or N-terminal 6-His 
tag respectively, prior to purification and characterization. For mammalian 
expression, all the constructs used in the study were derived from pcDNA 3.1 Zeo 
vector backbone encoding GCN4-hGLucl and hLuc2-GCN4 kindly provided by 
Stephen Michnick. Split-luciferase domains were replaced with GFP 10 and GFP 11 
fragments. Ku80, FRB were cloned into BspehXbal sites of pcDNA_GFP10; Ku70 and 
FKBP were amplified from cDNA and inserted into NothClal cloning sites of 
pcDNA_GFPll vector. 

Directed evolution of split-GFP fragments for tripartite complementation. The 

DNA construct encoding (GFP10)-ll(DVGSGGGS)-NdeI::GGGSGSGG::BamHI-l2 
(GGGSGGGS)-(GFPll), where GFP10 and GFP11 sequences were derived from 
superfolder GFP (Supplementary Note), was evolved by DNA shuffling. Libraries of 
GFP 10- 11 variants expressed from pTET SpecR vector were screened for improved 
solubility using a sequential induction protocol 16 in E. coli cells containing GFP 1-9 
Ml on a pET pl5 vector. At each round, protein solubility of selected optima was 
verified by complementation with GFP 1-10 in vitro. From the six brightest clones 
sequenced after three rounds of evolution, one best mutant (#5) was identified, 
termed GFP11 M4 (Supplementary Note). Upstream GFP10 fragment (GFP10 Ml) 
was further evolved by DNA shuffling and primer doping mutagenesis with a pool of 
fourteen synthetic oligonucleotide primers. Each primer was centered at one of the 
fourteen amino acids of the GFP 10 Ml domain, containing an NNN coding 
degeneracy at the central target amino acid and flanking homology to the GFP 10 Ml 
in the context of the cloning vector. A partially soluble protein HPS was then inserted 
in the NdehBamHl cloning site to obtain (GFP10 mutant Ml)-ll-HPS-l2-(GFPll 
mutant Ml) in pTET SpecR vector. After three rounds of selection using the 
sequential induction format from the pTET and pET plasmids 16 , one best-performing 
clone, termed (GFP10 M2)-Ll-Mfel::HPS::BamHl-L2-(GFPll M4) was isolated. 
GFP10 M2-HPS-GFP11 M4 fusion and GFP1-9 Ml inserts were swapped between 
pTET and pET plasmids to perform directed evolution of GFP 1-9 Ml. Briefly, 
cDNAs libraries of GFP 1-9 were expressed from pTET SpecR plasmids and tested for 
in vivo complementation assays in an E. coli strain expressing GFP 10 M2-HPS- 
GFP11 M4 fusion in pET pl5. At the second round, GFP1-9 OPT was isolated and 
sequenced. 
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GFP1-9 in vitro assays. Production of recombinant GFP1-9 OPT protein fragment 
was performed according to previous protocol described for split- GFP 1-10 16 . GFP 1- 
9 was expressed from a pET pl5 vector without a 6His tag and produced after 
induction 1 mM IPTG for 4 hours at 37°C. The protein was refolded from washed 
inclusions bodies (see GFP1-10 protocol 16 ) and solubilized in TNG buffer. To 
perform kinetic characterization of the newly engineered GFP 1-9 OPT and GFP 1-9 
Ml, 180 ul of equal amounts of GFP 1-9 refolded pellet fractions (ca. 0.5 mg/ml) were 
mixed with 20 ul of a soluble protein control sulfite reductase (SR) in fusion with 
sandwich GFP10 and GFP11 (GFP10-SR-GFP11) or GFP10-11 peptide (3.5 uM 
each). For Kl/El and El/El kinetic saturation studies, 25 ul of E1-GFP11 (6.25 uM) 
were mixed with 50 ul of GFP10-K1 or GFP10-E1 at various concentrations (800 nM 
down to 0.4 nM). For ligand induced interactions, rapamycin (LC laboratories) was 
diluted in DMSO and added in 10 ul aliquot to the final FRB/FKBP protein assay mix 
20 ul of GFP10-FRB + 20 ul FKBP-GFP11 (0.5 mg/ml). Assay was repeated by 
diluting FRB/FKBP samples in TNG buffer. Complementation was induced by 
addition of a large excess of GFP 1-9 OPT (150 ul, 0.25 mg/ml). Fluorescence kinetic 
(^exc = 488 nm/X em = 530 nm) was monitored with a FL600 Microplate 
Fluorescence Reader (Bio-Tek), at 3 min intervals, for 15 h. The background 
fluorescence of a blank sample (150 ul of GFP 1-9 OPT, 0.25 mg/ml and 50 ul 0.5% 
(w/v) BSA in TNG buffer) was subtracted from the final fluorescence values. 

Mammalian cell interactions assays. CHO cells were grown in Ham's F-12 medium 
(Gibco, Invitrogen Co) and 10% (v/v) fetal bovine serum (FBS); HEK 293 cells were 
grown in Dulbecco's Modified Eagle Medium (DMEM) and 10% (v/v) FBS. CHO 
cells were co-transfected with Lipofectamine 2000 (Gibco, Invitrogen Co.) with 
plasmids encoding for GFP1-9 OPT, GCN4-GFP1 1, GFP10-GCN4 or GCN4-GFP1 1 
M4+ GFP 1-10. Stable GFP 1-9 cell lines were produced by lentiviral transduction of 
HEK 293. HEK 293_GFPl-9 cells were co-transfected with jetPRIME reagent 
(Polyplus transfection) with 0.5 ug of each plasmid (GFP10 and GFP11 fusions); 
Twenty-four hours after transfection, cells were washed with PBS and mounted in 
DAPI- containing Pro Long Gold antifading reagent (Molecular Probes). Imaging was 
performed using a LEICA DM-RB fluorescence microscope, with a 40 X and 100X oil 
immersion objective to visualize the stained cells. Images from CHO cells were 
acquired with a Photometric CoolSNAP HQ camera and analysed with Metamorph 
or ImageJ softwares. Imaging of HEK cells was performed using a Zeiss (Carl Zeiss) 
confocal laser scanning inverted microscope (LSM 710 NLO with Quazar spectral 
detector array). Flow cytometry measurements were performed using a BD 
Biosciences FACSCalibur™ cytometer. Data analysis was performed using 
CellQwest® software (BD Biosciences). 
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